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ABSTRACT 

This  study  uses  the  scientific  and  patent  literature  as  a  source  of  data  to 
analyze  the  relationship  between  author! inventor  contribution  spans  and 
the  rate  of  technological  progress  in  two  chemical  fields.  Using  survival 
analysis  statistics,  the  authors  examine  the  probability  that  an  individual 
will  contribute  to  the  field  for  a  specified  length  of  time  and  the  probability 
that  an  individual,  having  contributed  to  the  field  for  a  specified  period  of 
time,  will  cease  to  contribute  in  the  future.  The  authors  aLo  test  the 
significance  of  several  covariates  in  predicting  the  length  of  contribution 
spans. 


introduction 

Predicting  the  rate  of  technological  progress  within  a  given  field  is  an  enduring 
problem  for  those  individuals  who  are  responsible  for  the  allocation  of  scarce  resources. 
Ideally,  if  managers  and  government  policy-makers  had  at  their  disposal  an  array  of 
indicators  to  enable  them  to  predict  the  rate  of  technological  progress  in  a  field,  optimal 
resource  allocation  would  be  assured.  Indeed,  the  ability  of  managers  and  policymakers  to 
comprehend  the  pace  and  the  direction  of  technological  advancement  will  largely 
determine  their  firm's  or  nation's  competitive  performance  in  world  markets  into  the  next 
century.  This  is  no  small  task.  Historical  accounts  of  industrial  evolution,  such  as  with  the 
development  of  semiconductors,  videocassette  recorders,  and  personal  computers,  show  the 
immense  difficulties  some  firms  encounter  when  conft-onted  by  new  technologies  [1 ,  2,  3]. 
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Undoubtedly,  there  is  an  obvious  need  to  enhance  our  understanding  of  the  way  in  which 
new  technologies  emerge.  To  this  end,  different  methodologies,  ranging  from  qualtitative 
case  studies  to  sophisticated  quantitative  forecasting  models,  have  been  developed  [4,  5,  6, 
7].  These  developments  have  established  technological  forecasting  as  an  academic 
discipline  in  its  own  right.  However,  the  many  pitfalls  characteristic  of  technological 
forecasts  and  the  often  limited  usefulness  of  their  outcomes  have  been  noted  with  striking 
regularity  [8,  9,  10].  As  a  consequence,  technological  forecasters  face  a  dilemma.  On  the 
one  hand,  indicators  of  technological  progress  have  often  been  illusive.  On  the  other  hand, 
intense  global  competition  in  industry  and  constrained  government  budgets  for  science  and 
technology  have  made  the  search  for  such  indicators  all  the  more  urgent. 

One  approach  to  understanding  the  rate  of  technological  progress  within  a  given  field  is 
to  focus  on  the  aaions  of  those  individuals  who  are  responsible  for  creating  progress; 
namely,  the  scientists  and  engineers  who  solve  the  problems  that  enable  the  technology  to 
move  forward.  The  basic  rationale  for  this  perspective  follows  that  in  the  process  of  solving 
problems  at  the  technological  frontier,  researchers  continually  formulate  their  own  opinions 
regarding  the  rate  of  progress  and  the  probability  of  success  within  a  particular  field.  Based 
upon  their  assessment  and  reassessment  of  a  particular  field,  researchers  will  decide  (or 
influence  the  decision)  where  to  best  apply  their  energies. 2  Simply  stated,  if  progress  is 
rapid,  researchers  will  be  more  likely  to  stick  with  their  research  agenda  long  enough  to 
reap  the  rewards  of  their  work.  Conversely,  if  progress  is  slow,  researchers  will  be  more 
likely  to  seek  out  more  promising  research  areas  where  they  can  apply  their  energies.  As  a 
result,  it  may  be  possible  to  ascertain  the  relative  rate  of  progress  in  a  field  by  examining 
the  duration  of  the  contributions  made  by  those  individuals  who  are  actively  involved  in 
shaping  its  technological  progress. 

The  following  study  focuses  on  how  long  researchers  persist  in  a  field  in  order  to  assess 
the  relative  rate  of  technological  progress.  Scientific  and  patent  literature  are  used  as  a 
source  of  data  to  measure  the  length  of  time  that  individual  authors  and  inventors 

^This  basic  rationale  has  been  a  recurring  theme  among  many  sociologists  of  the  sciences,  who  have  come  to  view 
scientists  as  "investors  of  credibility.'  That  is,  scientists    are  likely  to  invest  their  credibility  in  those  specialties  where 
they  hypothesize  the  probability  of  reaping  rewards  to  be  the  highest  [11]. 


contribute  to  the  field  and  to  determine  statistically  the  survival  and  hazard  rates  as  well  as 
some  of  the  factors  associated  with  longevity. 3  In  panicular,  the  duration  of  the 
panicipation  of  individuals  is  examined  through  an  analysis  of  their  "contribution  spans"  in 
a  field:  that  is,  the  time  span  between  their  first  and  last  paper  or  patent  contribution.  From 
an  analysis  of  the  contribution  spans,  estimates  are  made  ofi  (1)  the  probability  that  an 
author/inventor's  contribution  span  will  extend  a  given  number  of  years,  and  (2)  the 
probability  that,  having  contributed  a  given  number  of  years,  an  author/inventor  will  cease 
to  contribute  in  the  future.  Furthermore,  the  relevance  of  a  number  of  covariates  in 
prediaing  the  duration  of  author/inventors'  contribution  spans  is  examined.'* 

Catalysts  for  two  stereoregular  polymers — EPDM  and  polypropylene — were  selected 
as  the  technical  fields  for  comparative  analysis  in  this  study.  5  The  choice  of  EPDM  (a 
synthetic  rubber)  and  polypropylene  (a  plastic)  as  comparative  test  cases  was  based  solely 
on  the  independent  evaluation  of  individuals  in  the  chemical  industry  that  these  fields 
experienced  markedly  different  rates  of  technical  progress  over  the  past  several  decades. 
Indeed,  the  historical  record  reveals  that  since  the  discovery  of  the  Ziegler-Natta  catalyst 
process  in  the  1950s,  ft-om  which  both  processes  are  based,  the  rate  of  catalyst  development 
in  each  field  diverges  significantly.  In  the  case  of  polypropylene,  four  major  breakthroughs 
have  lead  to  the  development  of  three  well-defined  generations  of  catalysts.  In  contrast, 
EPDM  catalysts  have  evolved  less  rapidly.  No  major  breakthroughs  have  occurred,  and  in 
the  past  fifteen  years  no  radically  new  processes  have  been  developed.  The  catalysts  used  in 
EPDM  are  generally  considered  to  be  first  generation  technology.^ 

By  comparing  the  contribution  spans  of  EPDM  researchers  and  polypropylene 
researchers,  some  insight  may  be  gained  into  the  question  of  whether  or  not  the  duration  of 
their  participation  in  the  field  can  serve  as  an  accurate  indicator  of  the  rate  of  technological 


^Tlic  methodology  is  discussed  in  [12]. 


^For  lack  of  a  better  term,  we  will  refer  to  these  individuals  as  author/inventors  in  order  to  erKomp>ass  individuals 
whose  contributions  may  include  scientific  publications  or  patents. 

5Tlie  historical  development  of  EPDM  and  polypropylene  is  described  in  [13,  14,  15]. 

*It  must  be  noted  that  although  the  fields  realized  different  rates  of  technical  progress  in  catalyst  development,  this 
does  not  imply  that  one  field  is  necessarily  unsuccessful  in  a  commercial  sense.  Both  fields  arc  considered  commercial 
successes,  although  polypropylene  has  become  much  more  widely  used. 


progress.  It  is  expected  that  EPDM  researchers  would  be  more  likely  to  leave  their  field 
sooner  than  their  counterparts  in  polypropylene. 


DATA  COLLECTION  AND  METHODS 

Commercial  electronic  databases  were  used  to  identify  patents  and  publications 
related  to  the  fields  of  EPDM  and  polypropylene  catalyst  development.  The  databases 
were  searched  on-Une  using  a  set  of  key  terms  that  are  known  to  be  commonly  used  in  the 
lexicon  of  author/inventors  and  might  be  either  in  the  title,  abstraa  or  classification  terms 
of  a  document.  The  searches  resulted  in  the  retrieval  of  1,383  polypropylene-  and  613 
EPDM-related  patents  and  publications  between  1955  and  1989.  In  each  case,  the  majority 
of  documents  retrieved  were  patents  (60%  of  the  polypropylene  documents  and  78%  of  the 
EPDM  documents). 

The  documents  were  retrieved  electronically  and  were  temporarily  placed  in  a 
bibliographic  relational  database  operating  on  a  personal  computer.  This  allowed  for  a 
careful  inspeaion  of  each  document  in  order  to  ensure  the  accuracy  and  integrity  of  the 
search  procedure.  Since  multiple  source  databases  were  used,  it  was  necessary  to  remove 
duplicate  documents.  In  addition,  while  inspecting  the  database,  an  effort  was  made  to 
remove  misclassified  documents  that  did  not  pertain  to  EPDM  or  polypropylene  catalyst 
development.  In  the  process  of  inspeaing  the  documents,  any  that  seemed  inappropriate 
were  flagged,  so  that  an  individual  active  in  these  areas  could  make  the  final  judgment  as 
to  its  relevance.  Furthermore,  to  avoid  problems  of  incompleteness  in  the  later  years  due  to 
patent  lags,  as  well  as  in  the  earlier  years,  the  databases  were  constrained  to  the  period 
fi-om  1960  to  1986. 

The  data  collection  procedure  described  above  ultimately  resulted  in  the 
identification  of  613  EPDM  and  1,314  polypropylene  patent  applications  and  publications 
from  1960  to  1986.  The  contribution  span  data  subsequently  used  in  this  study  were 
derived  from  these  documents.  However,  before  the  documents  could  be  used  as  a  source  of 
data,  they  required  extensive  editing  in  order  to  create  consistency  among  author/inventor 


names  and  their  affiliation  names.  It  is  frequently  the  case  with  commercial  databases  that 
the  name  of  an  author  or  an  affiliation  is  not  standardized  across  documents.  Sometimes 
the  inconsistencies  arise  because  of  misspellings,  but  mostly  they  are  the  result  of  variations 
in  the  use  of  abbreviations,  middle  initials,  capitalizations  and  hyphenations.  Although  such 
a  lack  of  standardization  might  not  be  a  problem  for  the  typical  user  of  an  electronic 
literature  database,  it  would  be  a  major  source  of  error  in  determining  the  duration  of 
author/inventor  contribution  spans.  Therefore,  it  was  essential  to  meticulously  inspect  the 
name  of  each  author  and  affiliation  in  the  relational  database  so  that  all  inconsistencies 
could  be  eliminated.  A  particularly  unfortunate  complication  specific  to  the  patent 
literature  databases  is  the  frequent  absence  of  inventor  names  from  corporate  patents.  This 
required  the  use  of  multiple  databases  and  the  cross-checking  of  patent  numbers  in  order  to 
obtain  the  missing  data. 

Upon  completing  the  editing  of  the  documents,  the  database  was  used  to  identify  each 
author/inventor  who  contributed  to  the  field  over  the  twenty-seven  year  period.  This 
procedure  yielded  a  total  of  3,280  individuals.  At  this  stage,  a  statistical  database  was 
created  containing  several  covariates  for  each  author/inventor  that  were  derived  from 
information  obtained  from  the  published  documents  as  well  as  other  sources.  Table  1 
provides  a  list  of  the  variables  and  their  definitions. 

The  dependent  variable  for  the  analysis,  the  contribution  span,  is  calculated  as  the 
number  of  years  that  have  elapsed  from  the  first  to  the  last  known  patent  or  publication  for 
each  author.7  Although  calculating  the  contribution  span  is  relatively  straightforward,  there 
arc  some  methodological  issues  that  arise  that  require  ftarther  explanation.  The  primary 
issue  of  concern  is  that  for  those  author/inventors  who  are  active  in  the  fields  of  EPDM  and 
polypropylene  during  the  last  year  of  the  database,  the  ultimate  length  of  their  contribution 
span  is  indeterminate.  In  other  words,  since  these  individuals  have  not  yet  left  the  field,  it 
is  only  known  that  the  length  of  their  contribution  span  is  some  minimum  value  (that  is,  the 
entry  year  to  the  present  year).  To  account  for  this,  survival  analysis  statistics  were 
implemented   in   analyzing   the  data   [16,    17,    18,    19].   Such   techniques   take   into 

^  For  example,  if  a  researcher  first  published  in  1975  and  last  published  in  1980,  the  researcher's  contribution  span 
would  be  caJcuiated  as  six  years.  Furthermore,  it  is  assumed  that  a  researcher  who  publishes  in  only  one  year  has  a  span  of 
one  year.   Note  that  the  contribution  span  is  unaffected  by  the  frequency  of  publication  within  a  given  year. 


consideration  precisely  this  kind  of  problem  in  the  calculations  with  a  procedure  that 
adjusts  for  the  biases  that  right-censored  data  create. 

Having  determined  the  distribution  of  contribution  spans,  it  is  interesting  to  examine 
what  faCTors  might  affea  how  long  an  author/inventor  contributes  to  the  field.  Using  the 
literature,  a  number  of  covariates  were  construaed.  Although  they  could  be  treated  a5  time- 
varying  covariates  (that  is,  as  having  values  that  vary  yearly  in  the  course  of  an  author's 
contribution  span),  the  present  analysis  does  not  implement  such  an  approach  to  formulating 
the  data  set.  Therefore,  the  value  for  each  covariatc  is  taken  according  to  the  last  year  of  the 
author's  contribution  span.  In  this  manner,  several  covariates  were  created,  including  two 
dummy  variables  to  control  for  factors  that  might  account  for  heterogeneity  within  the 
population  examined.  First,  the  kind  of  organization  in  which  each  author  is  employed  was 
coded  according  to  whether  they  are  employed  in  an  industrial  or  non-industrial  (i.e., 
academic  or  government)  research  laboratory.  Second,  the  country  in  which  the  author  is 
located  was  coded,  and  a  covariate  was  created  to  signify  whether  the  individual's 
affiliation  is  located  in  a  Western  industrial  country  or  in  an  Eastbloc  country. 

Additional  covariates  were  created  which  reflect  individual,  organization  or 
population  anributes.  At  the  individual-level,  a  covariate  was  constructed  to  reflect  an 
author's  productivity  in  the  field  as  measured  by  the  cumulative  number  of  publications 
and  patents  to  their  credit.  An  organizational-level  covariate  was  created  to  reflect  the 
produaivity — or  what  might  be  considered  as  an  organization's  cumulative  investment  in 
the  field — in  terms  of  the  cumulative  number  of  patents  and  publications  assigned  to  the 
author/inventor's  affiliation. 

Three  population-level  covariates  were  created  to  reflect  the  size  and  dispersion  of 
each  field  in  each  year.  Population  size  is  measured  in  terms  of  the  number  of  individuals 
who  publish  or  patent  in  the  field  in  a  given  year,  as  described  below  and  illustrated  in 
Figure  1.  A  second-order  covariate,  the  square  of  population  size,  was  created  in  order  to 
capture  any  quadratic  association  between  population  size  and  contribution  spans.  The  third 
covariate  is  a  measure  of  dispersion  of  authors/inventors  among  different  organizations: 
that  is,  the  extent  to  which  the  population  is  concentrated  in  a  few  organizations  or  spread 


across  many.  For  this  purpose,  a  Hirfindahl  concentration  statistic,  which  is  determined  by 
calculating  the  sum  of  the  squared  share  of  author/inventors  affiliated  with  each 
organization  annually,  is  used. 

Lastly,  two  covariates  were  included  to  reflect  the  maturity  of  the  field.  The  first  is 
market  size  (in  terms  of  kilotons  of  EPDM  and  polypropylene  produced  annually).  Because 
statistics  for  total  world  produaion  are  not  available  over  the  entire  twenty-seven  years,  the 
aggregate  produaion  of  Western  Europe,  Japan  and  the  U.S.  is  used  instead. 8  The  second 
maturity  covariate  is  the  cumulative  number  of  patents  granted  in  each  field. 

RESULTS 

Using  data  from  the  scientific  and  patent  literature  on  EPDM  and  polypropylene 
published  between  I960  and  1986,  the  contribution  spans  for  3,280  author/inventors  and 
several  explanatory  variables  associated  with  each  were  compiled  into  a  statistical 
database:  2267  (69%)  individuals  in  polypropylene  and  1013  (31%)  in  EPDM.  Of  the 
3,280  cases,  739  (22.5%)  were  aaive  the  last  three  years  of  the  database  and  were  therefore 
classified  as  censored.  Sixty  percent  of  the  total  population  are  employed  in  industrial 
firms  and  nearly  twenty-five  percent  are  located  in  Eastbloc  countries. 

The  historical  growth  in  participation  in  each  field  can  be  seen  clearly  in  terms  of  the 
number  of  author/inventors  contributing  to  the  literature  (see  Figure  1).  The  number  of 
researchers  in  each  field  in  a  given  year  is  calculated  to  be  the  cumulative  number  of 
individuals  entering  the  field  (as  evidenced  by  an  initial  publication  or  patent  application) 
subtracted  by  the  cumulative  number  of  individuals  who  have  left  the  field  (as  evidenced 
by  their  failure  to  continue  to  publish  or  patent  in  a  future  year). 

The  data  were  analyzed  using  the  LIFETEST  and  LIFEREG  procedures  of  SAS  (v5.18). 
Using  the  LIFETEST,  the  first  step  in  the  analysis  was  to  make  non-parametric  estimates  of 
the  survival  and  hazard  functions  for  the  data.  The  lifetable  approach  was  chosen.  The 


^The  market  data  were  provided  by  the  marketing  research  department  of  a  major  chemical  firm.   The  data 
were  checked  for  accuracy  with  dau  from  Kline  &C  Co..  an  organization  that  publishes  statistics  on  the  chemical  industry. 


results  of  this  procedure  are  illustrated  in  Figure  2.  TTie  survival  funaion  for  each  field  is  a 
monotone  decreasing  function  and  are  nearly  identical.  Tests  of  homogeneity  of  the 
survival  curves  stratified  by  field  can  not  reject  the  null  hypothesis  that  the  strata  have 
identical  survival  distributions. The  probability  of  a  author/inventor's  contributions  span 
lasting  two  years  or  longer  is  about  0.3.  After  two  years,  the  survival  rate  continues  to 
diminish,  eventually  leveling-off  at  about  0.07  for  contribution  spans  of  15  years  or  more. 
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FIGURE  1 :      Growth  in  the  EPDM  and  Polypropylene  Author/ Inventor  Populations,  1960-86 


The  hazard  functions  are  non-monotonic  and  initially  very  different  for  each  field.  In 
both  cases  the  hazard  rate  decreases  markedly  for  author/inventors  who  have  contribution 
spans  of  at  least  two  years.  In  the  case  of  EPDM,  however,  the  probability  of  a 
author/inventor  ceasing  to  contribute  after  having  contributed  for  two  years  is  0.25.  In 
comparison,  for  polypropylene,  the  probability  of  a  author/inventor  ceasing  to  contribute 
after  having  contributed  for  two  years  is  0.16.  Thus,  the  hazard  rate  in  the  second  year  is 
1.56  times  higher  in  EPDM  than  in  polypropylene,  and  this  difference  is  found  to  be 
statistically  significant  (95%  confidence  interval).  Beyond  the  second  year,  there  is  no 
statistically  significant  difference  between  the  functions.  For  both  fields  the  hazard  rate 
varies  between  about  .05  and  0.15  between  the  second  and  tenth  year.  The  basic  implication 
of  the  hazard  funaion  is  that  the  longer  an  author/inventor  contributes  to  the  field,  the  less 


likely  he  or  she  will  be  to  leave  it,  with  the  first  and  sixth  years  being  particularly  critical 
points.  Indeed,  the  risk  of  leaving  the  field  is  highest  within  the  first  year. 
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FIGURE  2:   Non-Parametric  Estimates  of  Survival  and  Hazard  Functions  far  Author! Inventor  Contribution 
Spans  in  EPDM  and  Polypropylene 


The  next  step  in  the  analysis  wa5  to  determine  the  parametric  model  that  best  fits  the 
distribution  of  contribution  spans.  Although  non-parametric  analysis  permits  certain 
assumptions  that  can  be  made  about  the  shape  of  the  survival  distribution  (for  instance,  that 
it  is  non-monotonic),  nonetheless  we  decided  to  examine  statistically  several  different 
distributions  for  goodness  of  fit.  The  basic  model  adopted  for  the  analysis  is: 

Y  =  Xp  +  ae 


where  Y  is  the  log  of  the  contribution  span,  X  is  the  matrix  of  covariates,  P  is  a  veaor  of 
unknown  regression  parameters,  a  is  a  scale  parameter  and  £  is  a  vector  of  errors  from  an 
assumed  distribution.  This  model  is  referred  to  as  an  accelerated  failure  time  model 
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because  the  effect  of  the  explanatory  variables  is  to  scale  a  baseline  distribution  of  failure 
times.  In  order  to  determine  the  underlying  distribution  that  best  fits  the  data,  four 
different  types  of  distributions  were  evaluated:  the  exponential,  Weibull,  gamma,  and  log- 
logistic  distributions.  Using  LIFEREG,  the  results  of  this  procedure  for  the  entire  sample 
are  provided  in  Table  2.  The  parameters  are  estimated  by  maximum  likelihood  using  a 
Newton-Raphson  algorithm.  The  overall  fit  of  each  model  is  represented  by  the  log- 
likelihood  funaion.  Minus  two  times  the  log-likelihood  value  has  a  chi-square  distribution 
with  appropriate  degrees  of  freedom.  Using  the  baseline  model,  the  goodness  of  fit  for 
each  distribution  is  evaluated  in  term  of  minimizing  the  absolute  value  of  the  log- 
likelihood  score.  As  a  result,  the  log-logistic  distribution  (with  a  log-likelihood  score  of 
-1287)  was  chosen  and  became  the  basis  for  estimating  the  regression  coefficients  of  the 
explanatory  variables  in  the  model.  This  result,  which  is  consistent  with  the  non-parametric 
findings,  suggests  a  non-monotonic  hazard  funaion. 

Table  2  also  shows  the  results  of  the  inclusion  of  a  dummy  variable  (Technical  Field) 
to  stratify  on  the  two  fields  being  investigated.  The  estimates  for  the  log-logistic 
distribution  indicate  no  field  effect.  There  is  no  evidence  to  suggest  that  the  distribution 
of  contribution  spans  for  the  two  fields  are  divergent,  which  confirms  the  findings  from  the 
non-parametric  analysis.  This  result  is  consistent  across  all  distributions. 

In  the  next  step,  the  model  was  estimated  with  LIFEREG  in  a  sequence  of  steps  by 
adding  each  covariate  into  the  equation  using  the  log-logistic  distribution.  Since  it  is 
desirable  to  investigate  whether  the  significance  of  covariates  differs  among  the  two  fields, 
the  modelling  results  are  presented  separately  (see  Tables  3a  and  b).  In  the  case  of  each 
field,  the  addition  of  each  covariate  has  the  effect  of  generally  improving  the  log- 
likelihood  score.  Model  9  was  chosen  as  the  baseline  for  the  comparison  to  understand  the 
effea  of  the  covariates. 

The  estimation  results  of  Model  9  indicate  a  number  of  differences  in  terms  of  the 
significance  of  the  covariates  examined.  First,  several  covariates  that  are  significant  in  the 
case  of  polypropylene  are  not  significant  for  EPDM:  namely,  organization  productivity  (+), 
industrial  (-),  patents  (+),  and  market  size  (+)  are  all  significant  for  polypropylene  but  not 
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for  EPDM.  Second,  although  the  population  size  variables  are  significant  in  both  cases,  the 
signs  of  the  coefficients  indicate  a  different  relationship  in  each  case.  The  only  similarities 
between  the  two  fields  are  the  significance  of  author/inventor  produaivity  (+),  and  the  lack 
of  significance  for  the  Eastbloc  and  concentration  covariates. 

In  the  case  of  the  population  variables,  the  first-  and  second-order  population  terms  are 
significant  right  from  their  initial  inclusion  in  both  fields.  However,  in  the  case  of  EPDM, 
the  negative  coefficient  for  population  size  combined  with  the  positive  coefficient  for  the 
second-order  term  implies  a  U-shaped  relationship  between  population  size  and 
author/inventor  contribution  spans  (see  Figure  3).  The  data  indicate  that  when  the 
population  is  small,  its  size  is  negatively  related  to  the  length  of  contribution  spans;  but 
after  it  reaches  the  size  of  about  1 50  individuals  the  slope  of  the  curve  turns  positive.  This 
result  suggests  that  there  may  be  a  point  of  critical  mass  for  the  EPDM  author/inventor 
population,  at  which  its  size  is  sufficiently  large  to  become  significant  in  increasing 
contribution  spans.  It  is  interesting  to  note  that  the  EPDM  population  never  grew  beyond 
this  size  (refer  back  to  Figure  1).  In  contrast  to  the  EPDM  case,  the  relationship  between 
population  size  and  the  length  of  contribution  spans  in  the  case  of  polypropylene  is 
negatively  sloped,  and  increasingly  so,  as  the  population  grows  larger. 
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FIGURE  3:     Relationship  between  Population  Size  and  Contribution  Span:  EPDM  and  Polypropylene 
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DISCUSSION 


Using  the  scientific  and  patent  literature  as  a  source  of  data,  this  paper  provides  an 
analysis  of  the  contribution  spans  of  author/inventors  in  the  field  of  EPDM  and 
polypropylene  catalyst  development.  Non-parametric  estimates  of  the  survival  rate  and 
hazard  rate  are  made,  and  it  is  found  that  the  distribution  of  contribution  spans  follows 
most  closely  a  log-logistic  function.  In  addition,  a  statistical  model  of  the  relationship 
between  the  hazard  rate  and  a  set  of  covariates  is  examined. 

The  findings  of  this  analysis  indicate  that  the  sample  survival  and  hazard  functions  for 
3,280  author/inventors  in  EPDM  and  polypropylene  are  generally  similar.  About  30- 
percent  of  the  author/inventors  have  a  contribution  span  of  more  than  two  years.  The  risk  of 
an  author/inventor  ceasing  to  contribute  to  the  field  is  greatest  in  the  first  year  of  their 
contribution  span.  The  hazard  rate  declines  sharply  after  the  first  year  and  subsequendy 
remains  fairly  constant.  The  critical  difference  between  the  two  fields  comes  in  the  second 
year,  when  the  hazard  for  EPDM  researchers  is  more  than  one-and-a-half  times  higher  than 
for  polypropylene  researchers.  This  result  is  suggestive  of  the  basic  hypothesis  that 
individuals  confronted  with  slow  progress  in  a  field  will  be  more  likely  to  seek  an 
alternative  area  of  research.  The  fact  that  it  is  the  early  years  that  are  most  critical  is 
understandable,  since  at  that  point  not  much  of  a  researcher's  career  has  been  invested  in  the 
field.  Needless  to  say,  the  longer  one  stays  in  the  field,  the  less  likely  they  are  to  leave  it, 
regardless  of  the  rate  of  progress. 

An  examination  of  relationship  between  several  covariates  and  the  length  of 
contribution  spans  indicates  that  EPDM  and  polypropylene  are  quite  different.  Most 
noteworthy  is  the  difference  in  the  relationship  between  population  size  and  contribution 
spans.  It  appears  that  in  the  rapidly  progressing  field  of  polypropylene  catalysts,  the  larger 
the  field  became,  the  shorter  and  shorter  the  duration  of  contribution  spans  became.  This  is 
perhaps  the  result  of  the  competitive  pressures  that  arise  as  more  people  work  in  a  field.  In 
contrast,  EPDM  shows  a  much  different  relationship,  suggesting  that  a  lack  of  individuals  in 
the  field  had  a  detrimental  effea  on  the  length  of  contribution  spans. 
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Another  important  difference  lies  in  the  efFea  of  market  size.  In  the  case  of  EPDM, 
market  size  is  not  significant  in  influencing  contribution  spans.  However,  in  the  case  of 
polypropylene,  market  size  is  significant  and  has  a  positive  effect  on  the  length  of 
author/inventor  contribution  spans.  This  is  suggestive  of  the  notion  that  rapidly  progressing 
fields  of  technology  may  have  an  imporunt  "market  pull"  component. 

The  present  analysis  has  certain  limitations,  some  of  which  are  peculiar  to  literature- 
based  studies  and  some  of  which  are  more  generic  in  nature.  Given  that  the  primary  interest 
in  this  research  is  to  understand  progress  in  a  field,  it  will  be  necessary  to  construa  a  data 
set  that  implements  a  time-varying  covariate  data  struaure.  Such  an  approach  will  permit  a 
more  careful  scrutiny  of  the  dynamic  phenomena  that  affect  an  author/inventor's 
contribution  spans.  Furthermore,  this  approach  will  allow  for  the  examination  of  whether  or 
not  changes  in  the  hazard  rate  of  a  community  can  serve  as  an  indicator  of  future  momentum 
of  the  field.  It  is  also  necessary  to  determine  the  extent  to  which  the  present  and  future 
findings  from  the  EPDM  and  polypropylene  field  can  be  generalized  to  other  fields  of 
science  and  technology  and  to  examine  the  importance  of  other  explanatory  variables  in 
understanding  contribution  spans. 

Work  is  currently  underway  to  address  these  issues.  First,  a  preliminary  investigation 
suggests  that  data  from  the  literature  is  structured  in  such  a  manner  that  time-varying 
covariates  should  be  feasible  to  create.  Second,  data  sets  for  ten  additional  fields  are 
currently  being  constructed,  with  fields  varying  in  terms  of  their  size  and  disciplinary 
composition,  the  national  and  sectoral  distribution  of  their  author/inventors,  their 
commercial  impact,  and  the  degree  to  which  they  have  succeeded  in  becoming  well- 
established,  institutionalized  research  communities.  Third,  funher  studies  will  be 
supplemented  with  other  data,  derived  both  from  the  literature  and  from  other  sources. 

We  believe  the  approach  towards  the  scientific  and  technological  literature  outlined  in 
this  paper  offers  new  perspectives  to  the  application  of  bibliometric  methods  to 
technological  forecasting.  Instead  of  predicting  the  growth  and  the  decline  of  particular 
fields  by  looking  at  publication  or  patent  volumes,  our  research  points  to  the  usefijlness  of 
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publication  and  patent  information  in  determining  the  contribution  spans  of  researchers. 

Analyzing  contribution  spans  may  eventually  serve  as  a  useful  tool  for  managers  and 
government  policy-makers  who  are  responsible  for  monitoring  the  progress  of  emerging 
technologies  globally.  In  essence,  we  are  proposing  a  technique  that  allows  one  to  gauge  the 
worldwide  pulse  of  technological  progress  by  measuring  the  rate  of  change  in  researchers' 
commitment  to  a  field.  By  focusing  on  the  determinants  of  researcher  contribution  spans, 
our  aim  is  to  shift  attention  away  from  prediaing  the  technological  future  and  towards 
understanding  the  underlying  fijndamentals  of  researcher  behavior.  Improvements  in  our 
understanding  of  survival  and  hazard  rates  for  researchers  in  a  field  may  ultimately  lead  to 
the  identification  of  critical  factors  and  events  that  can  inform  our  policy  decisions 
regarding  emerging  technologies.  In  this  manner,  we  suggest  that  measurements  of 
researchers'  persistence  using  contribution  spans  may  serve  as  an  indicator  of  change  in  the 
rate  of  technological  progress  in  a  particular  field.  Perhaps  we  can  most  clearly  envision 
our  technological  future  by  understanding  in  a  comprehensive  and  systematic  manner  the 
sustained  commitment  of  researchers  to  the  ideas  they  are  pursuing  today. 
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TABLE  1 
Variables  used  in  the  analysis  and  their  definitions 


CATEGORY 

VARIABLE 

DESCRIPTION 

Dependent  variable 

Contribution  span 

Span  of  years  since  researcher's  first  and  last  patent 
application  or  publication  in  the  field 

Covariates: 

Technical    field 

EPDM=0;  polypropylene=l 

Author/inventor  productivity 

Cumulative  number  of  patent  applications  or 
publications  in  the  field  by  the  author 

Organization  productivity 

Cumulative  number  of  patent  applications  and 
publications  in  the  field  by  the  author/inventor's 
organization 

Industrial 

Author/inventors  employed  in  industrial  firms  =  1; 
otherwise  =  0 

Eastbloc 

Author/inventors  from  Eastbloc  countries=  1; 
otherwise  =  0 

Market  size 

Annual  industrial  production  of  EPDM  and 
polypropylene  in  kilotons 

Patents 

Cumulative  number  of  patents  granted  in  the  field 

Population 

Total  number  of  inventors/authors  in  the  field 

Population^/lOOO 

Second  order  term  of  population  size 

Dispersion 

Index  of  organizational  dispersion  of 
inventors/authors  in  the  field  (Hirfindahl  index) 

TABLE  2 
ML  estimation  of  contribution  spans  using  different  distributions 


£!(p?n':n(i»l 

Wcibull 

Gamma 

Log-logistic 

Intercept 

-0.228' 
(0.133) 

^.458"' 
(0.068) 

-0.101 
(0.061) 

-0.261*** 

(0.042) 

Technical    field 

0.090 
(0.089) 

0.032 
(0.044) 

0.022 
(0.042) 

0.007 
(0.028) 

Author/inventor  producdviry 

0.487'" 
(0.022) 

0.671"* 
(0.016) 

0.349'" 
(0.007) 

0.419**' 
(0.009) 

Organization  productivity 

0.002' 
(0.001) 

0.002*" 
(0.001) 

0.002*** 
(0.001) 

0.002*** 
(0.000) 

Industrial 

-0.033 
(0.070) 

-0.050 

(0.034) 

-0.083*** 
(0.031) 

-0.047*' 
(0.021) 

Eastbloc 

-0.011 
(0.085) 

-O.080 
(0.042) 

-0.043 
(0.038) 

-0.015       ' 
(0.026) 

Market  size 

0.0004*" 
(0.000) 

0.000  r" 

(0.000) 

0.0002*** 
(0.000) 

0.0002'" 
(0.000) 

Patents 

0.005"* 
(0.000) 

0.003"* 
(0.000) 

0.002*** 
(0.000) 

0.001'" 
(0.000) 

Population 

^.005"' 
(0.001) 

-^.003"* 
(0.001) 

-0.003*** 
(0.001) 

-0.002"' 
(0.000) 

Population^/lOOO 

-0.024"* 
(0.003) 

-0.012"* 
(0.001) 

-0.011*" 
(0.001) 

-0.010'" 
(0.000) 

Dispersion 

-O.OOl' 
(0.001) 

-0.001"* 
(0.000) 

-0.001** 
(0.000) 

-o.ooo' 

(0.000) 

Scale  parameter 

0.493 
(0.007) 

0.465 
(0.000) 

0.197 
(0.004) 

Shape  parameter 

0.074 
(0.000) 

Log- Likelihood 

-2880 

-2047 

-1761 

-1287 

NOTES:  1.)      Total  number  of  cases  =   3139  number  of  right-ccnsorcd  values  =  739 

Z)      Figures  in  f>arcnthescs  arc  standard  errors  of  estimates 
3.)      Significance  level:   *<  .1     "<  .05    "'<  .01 
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